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Abstract 



f\l ■ In this paper, we study the exploration / exploitation trade-off in cel- 

lular genetic algorithms. We define a new selection scheme, the centric se- 
lection, which is tunable and allows controlling the selective pressure with 
a single parameter. The equilibrium model is used to study the influence 
of the centric selection on the selective pressure and a new model which 
takes into account problem dependent statistics and selective pressure in 
order to deal with the exploration / exploitation trade-off is proposed: the 
punctuated equilibria model. Performances on the quadratic assignment 
problem and NK-Landscapes put in evidence an optimal exploration / 
^- ' exploitation trade-off on both of the classes of problems. The punctuated 

C**) , equilibria model is used to explain these results. 

\o 

1 Introduction 

^^ I The exploration/exploitation trade-off is an important issue in evolutionary 

computation. By tuning the selective pressure on the population, one can find 
an optimal (or near-optimal) tradeoff between exploitation and exploration. In 
cellular Evolutionary Algorithms (cEAs), the population is embedded on a bidi- 

k> \ mensional toroidal grid and each solution interacts with its neighbors thanks to 

j_j ■ a certain neighborhood. The convergence rate of the algorithm is then depen- 

dent of the shape and size of the grid and of the neighborhood. The smallest 
symetric neighborhood that can be defined is the well-known Von Neumann 
neighborhood of radius 1. It guarantees a slow isotropic diffusion of genetic in- 
formation through the grid. But when solving complex multimodal problems, it 
is necessary to slow down even more the propagation speed of the best solution 
because the algorithm still often converges over a local optimum. 

Our goal in this paper is to establish a relation between the selective pressure 
on the population and the effects of recombination and mutation operators, 
in order to find an optimal exploration/exploitation trade-off. To do so, we 
propose a new selection scheme able to control the selective pressure and a 
theoretical model which takes into account the effects of stochastic variations 
on an optimization problem. In section 2 we define a selection scheme able to 



tune the selective pressure and present the algorithm used in the experiments. In 
section 3, we analyze the selective pressure with respect to the selection method 
and present a new model which takes into account the stochastic variations. In 
section 4 we present performances on Quadratic assignment problem instances 
and on NK-Landscapes and we explain the results with the model proposed. 

1.1 Cellular Evolutionary Algorithms 

A cellular Evolutionary Algorithm (cEA) [23] is an EA in which the population 
is embedded on a bidimensionnal toroidal grid (see figure [1]). Each cell of the 
grid contains a solution. Embedding the solutions on a grid allows defining a 
neighborhood between the cells. The most commonly used one in cEAs is the 
Von Neumann neighborhood (shown on figure [T]). At each generation, every 
cell on the grid is updated by selecting parents in its neighborhood and ap- 
plying stochastic operators such as crossover and mutation. Several strategies 
exist, synchronous and asynchronous, to update the cells. The small overlapped 
neighborhoods guarantee the diffusion of solutions through the grid 18 . Such 
algorithms are especially well suited for complex problems !9] and are of advan- 
tage when dealing with dynamic problems |20j . 



Figure 1: Representation of a cEA and Von Neumann neighborhood in dashed 
line. 



1.2 Selective pressure 

One of the main properties that differs between EAs and cEAs is the rate of 
convergence (propagation speed of the best solution) : It is exponential for EAs 
and quadratic for cEAs. Therefore, the selective pressure on the population is 
weaker for a cEA than for an EA. Controlling the selective pressure is critical 
since it can avoid premature convergence of the algorithm when solving complex 
multimodal problems. Several parameters related to the selective pressure can 
prevent the algorithm from getting stuck in a local optimum. The topology 
of the grid, the local neighborhood, the properties of the selection operator are 
such parameters. By correctly tuning these parameters for a given problem, one 
can find a good exploration/exploitation tradeoff and minimize the risks of pre- 
mature convergence. Sarma et al. established a link between the radius of the 



neighborhood and the radius of the grid : changing this ratio directly affects the 
selective pressure on the population [16J . Alba et al. analyzed performances of 
cEAs with a fixed size neighborhood and different grid shapes. They arrived to 
the conclusion that thin grids are well-suited for complex multi modal problems 
and large grids are well-suited for simple problems. The main explanation is 
that thinner grids give lower selective pressure [3j. Takeover times and growth 
curves analysis are useful to measure the selective pressure on a population, but 
it is not sufficient to decide of a trade-off between exploration and exploitation: 
it is necessary to include effects of the stochastic variations due to the operators 
in the analysis. Janson et al. proposed a hierarchical cEA which allows achiev- 
ing different levels of exploration /exploitation tradeoff in distinct zones of the 
population simultaneously [8]. 

A standard technique to study the induced selective pressure without in- 
troducing the perturbing effect of variation operators is to let selection be the 
only active operator, and then monitor the number of best solution copies in 
the population [BJ. The takeover time is the time needed for one single best 
solution to colonize the population with selection as the only active operator. 
Let A be the size of the population, t the number of generations and N(t) the 
number of best solution copies at generation t. The population is initialized 
with one solution of good fitness and A — 1 solutions of null fitness. Since no 
other evolution mechanism but selection takes place, the good fitness solution 
spreads over the grid. The takeover time is then the smaller time t such that 
N(t) = A. Analysing the growth of N(t) as a function of t also gives an indi- 
cation on the selective pressure. It shows the convergence rate of the algorithm 
when selection is the only active operator. When the slope of the growth curve 
of N(i) is low, the convergence rate is low and the takeover time is high. On 
the other hand, a high slope of the growth curve leads to a short takeover time. 
In the first case, the selective pressure on the population is weaker than in the 
second case. 

1.3 Growth curves and takeover time models 

Characterizing the growth curves and the takeover time is an important issue 
in the study of the selective pressure [SJ . Many models have been proposed to 
define the behaviour of structured population evolutionary algorithms. Sarma 
and De Jong proposed a logistic model in which the coefficient of the growth 
curve of the best solution is shown to be an inverse exponential of the ratio be- 
tween radii of the neighborhood and the underlying grid |16) . This conclusion 
was guided by an empirical analysis of the effects on the convergence rate and 
the takeover time of several neighborhood sizes and shapes. Sprave proposed a 
hypergraph based model of population structures and a method for the estima- 
tion of growth curves and takeover times based on the probabilistic diameter 
of the population |19j . Gorges-Schleuter proposed a study about takeover time 
and growth curves for cellular evolution strategies. She obtained a linear model 
for ring populations and a quadratic model for a torus population structure [7] . 
Several authors wrote about theoretical or empirical models of growth curves 



and takeover time. Giacobini et. at proposed a model for cellular evolutionary 
algorithms with asynchronous update policies [4]. He summarized his results 
and proposed models for synchronous updates [5] that will be evoked later in 
this paper. Alba proposed a model for distributed evolutionary algorithms con- 
sisting in the sum of logistic definition of the component takeover regimes j2] . In 
his paper, he made an interesting review of existing models and compared two 
of them (the logistic model and the hypergraph model) with his newly proposed 
one. For a detailed state of the art of cEAs, see pQ. 

2 Centric selection 

In this section we present a new selection scheme for cEAs that allows tuning 
accurately the selective pressure. 

Algorithm 1 Centric Selection algorithm 
CentricSelectionindcx: int, j3: double 
neighbors < — GetNeighborhood(index) 
candidate^ i — Select (neighbors, 0) 
candidate? < — Select (neighbors, j3) 
Hest(candidatei, candidate?) 

The centric selection (CS) idea is to change the probability of selecting the 
center cell of the neighborhood. This scheme allows slowing down the conver- 
gence speed while keeping an isotropic diffusion of good solutions through the 
grid. The CS is a determinist tournament selection. But unlike the standard 
deterministic tournament, cells in the neighborhood may have different proba- 
bilities of being selected for the competition. The anisotropic selection [17] is an- 
other selection scheme which modifies the probability of selection a cell for a de- 
terministic tournament. With the anisotropic selection, the diffusion of solutions 
is not isotropic, so we propose the CS which is easier to study. We have p c = fi 
the probability of selecting the center cell and p„ = p s = p e = p w = j(l — j3) 
the probability of selecting either north, south, east or west cell. When /3 = |, 
all cells have the same probability of being selected for the competition: this 
particular case of CS is the standard binary tournament selection. When (3 = 1, 
only the center cell can be selected for the tournament: in this particular case 
where the same solution is selected two times, the crossover operator is not ap- 
plied in the cEA. Only mutations are applied to the solution, and with an elitist 
replacement strategy, the algorithm behaves as the parallelisation of as many 
hill climbers as there are solutions in the population. The CS is described in 
algorithm [1] The candidates compete in a deterministic tournament returning 
the best one. For each cell on the grid, two parents are selected per generation, 
as we can see in the algorithm [2] Stochastic variations operators are applied to 
the parents, generating two children. The replacement strategy is elitist: the 
best child replaces the current solution on the grid if it has a better fitness. The 
use of a temporary grid is necessary for a synchronous update of the cells. 



Algorithm 2 Description of our cEA 



cEApopulation: vector, j3: double 
tcmpGrid: vector 
while continue() do 

for i = 1 to GridSize do 
parenti < — CcntricSelection(i, /3) 
parent2 < — CentricSelection(i, /3) 
(childi, childi) < — Crossover (parenti, parents)) 
Muta,te(childi) 
Muta,te(child 2 ) 

tempGrid[i] < — Hest(population[i], childi, child®) 
end for 

Replace(population, tempGrid) 
end while 



3 Modeling cEAs 

In this section, we present two models of the search dynamic in cEAs. In the 
first one, the Equilibrium Model (EM) we consider that the optimal solution 
has been found and observe how it colonizes the grid. This model is classical in 
the studies on the selective pressure and one the exploration/exploitation trade- 
off. The informations given by this model are takeover times and best solution 
growth curves. As the stochastic variations operators are not taken into account, 
the same dynamic occurs in experimental runs when the recombination and 
mutation operators are ineffective: when the system has reached an equilibrium. 
In the second one, we consider that a better solution can be found with a 
certain probability and observe the frequency of apparition of this new solution 
with respect to our algorithm's parameters. It is a model of the transition 
between two periods of fitness stability. We call this new model the Punctuated 
Equilibria Model (PEM). 

3.1 Equilibrium model 

In order to measure the selective pressure induced by the CS, we observe what 
happens when no more solution improvement is possible. In this case, crossover 
and mutation are no longer useful and the evolution process has reached an 
equilibrium. Hence, we observe the time needed for a single best solution to 
conquer the whole grid, and look at the growth curve obtained and the takeover 
time. 

We measure the effects of CS on selective pressure by observing these growth 
curves and takeover times on a square grid of side 64. Figure [2] shows the 
takeover time as a function of j3. The takeover time is not defined for j3 = 1. 
The selective pressure drops when the value of (3 increases. We can see on figure 
[3] the growth of the number of copies of the best solution in the population 
(top) and its growth rate (bottom). There are two stages in the shape of the 
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Figure 2: Average takeover time as a function of /3. 



curve. The growth rate is linear in the first part and quadratic in the second 
part. When using this selection scheme, the diffusion of the best solution is 
still isotropic. So the best solution roughly propagates describing an obtuse 
square as long as no side of the grid is reached. This corresponds to the first 
part of the growth rate curve. Once the sides are reached by some copies of the 
best solution, the dynamic changes as we can observe on the second part of the 
growth rate curves. 

3.2 Punctuated equilibria model 

In this section, we propose a new model which will help in the understanding 
of the search dynamics of an Evolutionary Algorithm. This model was first 
designed for a cellular EA but can be easily extended to any kind of evolution- 
ary algorithm. We consider a cEA initialized with random solutions. We make 
sure that the best solution in the population is unique. Our goal is to simulate 
an evolutionary run: We simulate recombination and mutation operators with 
probabilities that the mating is efficient or not (i.e. produces a new best solu- 
tion). We consider three different types of matings: between two copies of the 
best solution (mating 11), between one copy of the best solution and one sub- 
optimal solution (mating 01) and between two sub-optimal solutions (mating 
00). We introduce probabilities Pn, Poi and Poo that matings of type 11, 01 
and 00 produce a new best solution, fitter than the previous best one. Figure H] 
is an example of evolutionary run on some optimization problem (minimisation 
task). We can see that there are some stagnation periods where the best so- 
lution don't improve. Then, an amelioration occurs and the population enters 
another stability period. An evolutionary run is a sum of stagnation periods 
and punctual improvements. Our punctuated equilibria model computes the 
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Figure 3: Growth curves of N(i) (top) and its growth rate (bottom) for different 
values of (3. 



probability of improving the best solution in the population according to the 
variables described above. 

With this model, the probability of finding a new best solution at a given 
generation t is : 

P (t) = 1 - (1 - P o)" oo(t) (l - Poi)" ol(t) (l - Pn) nil{t) 

where noo(t), noi(t) and nn(i) are the number of matings of each type for the 
generation t. 
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Figure 4: Example of evolutionary run 

The average time to find a new best solution is given by : 

E = Y J tp{t) 

4>1 

The performance of an algorithm can be measured by the time E needed to find 
a new best solution but also by the probability P of improvement in a preset 
time T. We have the probability of improving the best solution in T generations 



p = i-IL=i(i -*>(*)) 

P = l-(l-P o) Soo(T) (l--Poi) Enl(T) (l-Ai) Ell(T) 

with Ey(T) = Ylt=i n ij(t) the sum over T of mating of each type. 

The parameters Py are problem dependent and the values of £y are given 
by the selection scheme used. The selection process is usually controlled by 
a parameter such as the tournament size or in the case of the CS: f3. This 
parameter should be used to maximize the probability P. Intuitively, the ideal 
selection process maximizes the Ey which have the higher Py . More precisely, 
assuming that the control parameter of the selection process is /3, the parameter 
/3* which maximizes the probability P(T) verifies: 



gc/n = o 



dp 



which gives: 



1 In the following equations, we only denote the dependance on /3 for P and Sjj for read- 
ability. 



log(l-P 00 )^(/3*) 
log(l-P 01 )^(/3*) 
log(l - Pn)^f(/3*) = 



log(l-P 01 )ff(/3*) (1) 



If it is possible to have a model of £y (/3) , it would be possible to calculate 
the optimal /3 as a function of Py . 

In this model, the exploration/exploitation tradeoff is given by the number 
of each possible matting (00, 01 and 11). The model could be used to explain 
the probability and the time to find a new best solution according to the selec- 
tive pressure, and also to tune the value of parameters which have an impact 
on the selective pressure, such as j3, to have the highest probability to evolve 
toward a new best solution. Equation [TJ gives precisely the best exploration and 
exploitation tradeoff and allows computing the optimal value of j3 (in our case) 
for this trade-off. In the following, we will show the validity of the PEM on 
some optimization problems. 

4 QAP and NK landscapes 

In this section, we study the effect of selective pressure on performances through 
experiments of a cEA with CS on two well-known classes of problems. The 
optimal exploration / exploitation tradeoff found will be explained thanks to 
the PEM presented in the previous section. 

4.1 Problems presentation 

The problems proposed, Quadratic Assignment Problem and NK landscapes, 
are known to be difficult to optimize. The important number of instances of the 
Quadratic Assignment Problem and the tunable parameters of the NK land- 
scapes allow managing the difficulty of the problems. 

4.1.1 Quadratic Assignment Problem 

This section presents the Quadratic Assignment Problem (QAP) which is known 
to be difficult to optimize. The QAP is an important problem in theory and 
practice as well. It was introduced by Koopmans and Beckmann in 1957 and 
is a model for many practical problems [TTJ . The QAP can be described as the 
problem of assigning a set of facilities to a set of locations with given distances 
between the locations and given flows between the facilities. The goal is to place 
the facilities on locations in such a way that the sum of the products between 
flows and distances is minimal. 

Given n facilities and n locations, two n x n matrices D = [dki] and F = [fij] 
where dki is the distance between locations k and / and fij the flow between 
facilities i and j, the objective function is : 



® = '52'52 d p(i)pti)fk 



where p(i) gives the location of facility i in the current permutation p. 
Nugent, Vollman and Ruml proposed a set of problem instances of different 
sizes noted for their difficulty |14j . The instances they proposed are known to 
have multiple local optima, so they are difficult for an evolutionary algorithm. 
The best algorithm known is the fast hybrid evolutionary algorithm 13J which 
combines an evolutionary algorithm with an improvement of the fast tabu search 
of Taillard. 

Set up 

We use a population of 400 solutions placed on a square grid (20 x 20). Each 
solution is rcprensented by a permutation of TV where N is the size of a solution. 
The algorithm uses a crossover that preserves the permutations: 

• Select two solutions p\ and pi as genitors. 

• Choose a random position i. 

• Find j and k so that pi(i) = P2O) and P2(i) = Pi(k). 

• exchange positions i and j from p\ and positions i and k from pi . 

• repeat iV/3 times this procedure where N is the size of an solution. 

This crossover is an extended version of the UPMX crossover proposed in 
[12] . The mutation operator consists in randomly selecting two positions from 
the solution and exchanging those positions. The crossover rate is 1 and we 
do a mutation per solution. We perform 200 runs for each tuning of the two 
selection operators. An elitism replacement procedure guarantees the solutions 
stay on the grid if they are fitter than their offspring. 

4.1.2 NK landscapes 

The NK landscapes were proposed by Kaufmann to model the boolean net- 
work and used in optimisation in order to explore how epistasis is linked to the 
ruggedness of search spaces [TU] . Epistasis corresponds to the degree of inter- 
actions between the "loci" of a solution and ruggedness is the number of local 
optima of the search space. The main characteristic of NK Landscapes is that 
they allow tuning the epistasis level with a single parameter K. The parameter 
N determines the length of the solutions. 

The fitness of solutions for a NK landscape is given by the function 

/ : {0, f } N -> [0, f] 
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defined on binary strings of length N. Each binary string is a solution with 
N locations. An atom with fixed epistasis level is represented by a fitness 
component 

/ 2 :{0,1} A ' +1 ^[0,1] 

associated to each bit i. It depends on the value of the bit i and on the value of 
K other bits of the string (K must fall between and N — 1). The fitness f(x) 
of x £ {0, 1}^ is the average of the values of the N fitness components fi : 

1 N 
f(x) = —^2f l (x l ,x ll ,...,x ik ) 

i=\ 

where {i\, ..., i k } C {1, ...,i — l,i + 1, ...,N}. Many ways have been proposed 
to choose the K other locations from the N of the solutions. The mainly used 
ones are adjacent and random neighborhoods. With the first one, the K near- 
est locations of the location i are chosen (the solution is taken to have periodic 
boundaries). With the random neighborhood, K locations are randomly se- 
lected from the solution. Each fitness component fi is specified by extension, 
ie a random number J/i ( X4)X4I Xjfc ) from [0, 1] is associated with each element 
(Xi, xn, ..., Xik) from {0, 1} K+1 . Those numbers are uniformly distributed in the 
interval [0, 1]. 



Set up 

The size of the population is 400 solutions. The crossover used is a one point 
crossover, applied with a probability of 1. The mutation is a bit flip applied 
with a probability of — where n is the size of a solution. We perform 200 runs 
for every parameter set, and each run stops after 1500 generations. Runs are 
performed on instances of sizes N = 32, with K £ 2. .12. 

4.2 Performances 

Figure [3] and table Q] show performances of a cEA using CS on some QAP 
instances of various sizes. The instance in figure [5] is a well-known instance 
of size 30. The first fact that we notice when looking at these results is that 
there is an optimal setting, different from the extreme values and 1 for the 
parameter j3. This indicates that for a certain setting of the parameters, and 
thus for a certain selective pressure, the search dynamic leads to optimal results. 
Curves representing the instances summarized in table Q] have the same shape 
as figure [5j On each instance, the optimal value of /3 is around 0.85. The 
performances increase up to these values and then decrease. Performances of 
CS are significantly better than the one obtained with a cEA using standard 
binary tournament selection. The standard cEA is observable on the curve at 
the points f3 — 0.2 and is reported in the table [T] We can also notice in table Q] 
that the standard deviation is lower for the optimal value of /3 than with a cEA 
with binary tournament selection. 
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Figure 5: Performances on nug30 for a cEA using CS. 



Table 1: Avg. results and std.dev.on QAP instances 



Instance 


Std cGA 


Best avg. results 


Opt. 


Nug30 
Tai40a 
Sko42 
Tai50a 
Tai60a 


6178 [28] 
3.23 x 10j 14343 ] 

15969 [75] 

5.092 x 10f 20721] 

7429118 f27760l 


6144 [14] 
3.21 x 10[ 12000 ] 

15909 [34] 

5.080 x 10f 13372] 

7385390 fl9 39n 


0.88 
0.84 
0.82 
0.82 
0.86 



Figure [6] and table [2] present performances of a cEA with CS on some in- 
stances of NK landscapes. Parameters of the landscapes are N = 32 and K = 10 
for the figure [5] and are summarized in table [2] for the other instances. 

We can see that the shape of the performances' curve is different from the 
QAP curve. The performance increases until reaches its maximum value. 
The same results are obtained for all the instances in table [2j The parameter 
K tunes the difficulty of the instance. We can see that for K = 2, there is 
no optimal value for (3. The reason is that the optimum is always found. For 
K = 4, the standard cEA sometimes get stuck in a local optimum, and with 
= 1 our algorithm always find the optimum. On every instance, except K = 2, 
the optimal value for j3 is 1. 

However, this value /3 = 1 is a particular one, since it breaks all communica- 
tions on the grid. As long as the value of the parameter increases, the chances 
of selecting two different solutions for recombination decrease. For j3 — 1, the 
algorithm is the parallelisation of as much hill climbers as there are cells on the 
grid : It constantly selects the center cells of the neighborhoods, so there is no 
crossover and any amelioration is due to a bit flip. 
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Table 2: Avg. performances and std.dev.on NK instances with N = 32 



K 


Std cGA 


Best avg. results 


Best j3 


2 
4 
6 
8 

10 
12 


0.734329 [0] 

0.79597 [0 . 03] 
0.782934 [0 . 01] 

0.771277 [0 . i] 
0.763510 [o . O i] 
0.750043 [0 . O i] 


0.734329 [0] 
0.798197 [0] 
0.799124 [0 . 003] 
0.789103 [0 . 00 4] 
0.785115 [0 .oo3] 
0.774479 [0 .oo9] 


[0,1] 
1 
1 
1 
1 
1 



So the best setting for CS can be compared to the parallelisation of as many 
hill climbers as there are cells on the grid. The parallelisation of hill climbing 
seems to be a good algorithm for solving NK landscapes problems, which could 
be explained by the size of basins of attraction [21 and 15 . 



0.79 r 



0.785 




Figure 6: Performances on NK with N = 32 and K = 10 for a cEA using CS. 



4.3 Probabilities of discovering better solutions 

In order to explain the optimal values of (3 for QAP and NK-Landscape and 
to validate the PEM, we compute P, the probability of discovering a new best 
solution in the population taken from the PEM, with real data. We calculated 
it for one instance of QAP and one instance of NK-Landscapes. With this 
calculation we want to find the value of /3 that maximizes the probability of 
discovering a new solution. This probability depends on the value of Sy , and 
thus on time: if at a generation t no new solution is discovered, the actual 
best solution spreads in the population according to the selective pressure. If 
during an interval of time corresponding to the takeover time no new solution 
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is discovered then the population converges. We can compute the ideal /3 value 
for a given number of generations T because Sy(T) relies on j3 and on time: 
after T generations Sjj(T) is different according to (3, and for the optimal value 
of /3 Ejj(T) leads to the best probability P. 

We estimated the Sy with the same experiments done to compute growth 
curves and takeover time. We averaged the number of matings of each type 
at each generation over 10 3 runs. Then, we needed to know the probabilities 
-Poo, Pol an d Pn- We estimated these probabilities using a Bayesian process 
during the runs. We averaged the values obtained by generations over 500 
runs. Figure [7] shows the result of the estimation of probabilities on the QAP 
instance Nug30. The ordonate scale is logarithmic because of the variations of 
probabilities. The curves representing the Py intersect, so the value of /? which 
maximizes P may change during a run. We computed P with estimated values 
of Pij taken by steps of 50 generations. The values of Sy are also generation 
dependent. For each value of /J, we took the Sy value after 100 generations: 
that is £jj(100). During a run, it would correspond to allowing a stagnation 
period of 100 generations before stopping the run, which is reasonnable. 




Figure 7: Estimated Pij on QAP instance nug30 

The figure |S] shows the optimal values of /3 as a function of generations for 
the QAP instance Nug30. During the first 700 generations, the optimal value is 
/3 = 0.2. Then, there is a transition of approximatively 150 generations. During 
this phase, the ideal value of /3 grows until it reaches 1. In our experiments, we 
observe optimal values of/? between 0.8 and 0.9 according to the QAP instance. 
Values of /3 are constant during the runs. But the PEM shows that the selective 
pressure should be strong at the beginning (low values of /3) and then weak 
(high values of (3). If (3 is constant, intermediate values in the range [0.8,0.9] 
give the best average selective pressure for QAP instances. 

The figure [10] shows the optimal values of /3 as a function of generations for 
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Figure 8: Optimal values of j3 on QAP instance nug30 




Figure 9: Estimated Pij on NK-Landscape with N — 32 and K = 10 



a NK-Landscape with N = 32 and K = 10. We can see that the optimal /3 
value increases fastly and reaches 1 in the early generations. The ideal selective 
pressure is weak, and it is not surprising that the best performances are obtained 
when j3 = 1 in our experiments. Figure |H] shows the estimation of Py as a 
function of generations. We can see that the curve representing Pn drops down 
very fast. With a negligible probability of improving the current best solution 
with mutations, there is no sense in spreading this solution. With (3 = 1, the 
current best solution in the population cannot spread. 

The PEM has been used in order to explain the exploration / exploitation 
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Figure 10: Optimal values of f3 on NK-Landscape N — 32, K = 10 



trade-off on two different classes of problems. Coupled with the centric selection, 
it showed the ideal selective pressure along the search process. This model can 
be used to tune any parameter which has some influence on the number of 
matings of each type defined in the previous section. The computation cost 
is low, since the estimation of probabilities by a Bayesian process is precise: 
we averaged the estimation on 500 runs but the standard deviation was low 
(w 10~ 6 ).The Sjj are only computed once since they are independent from the 



optimization problem tackled. 
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5 Conclusion 

The exploration/exploitation trade-off is an important issue in evolutionary al- 
gorithms. In this paper, we propose a model that takes into account stochas- 
tic variations and improvement of the quality of the solutions, the punctuated 
equilibria model. In order to study the exploration / exploitation trade-off we 
propose a tunable selection operator: the centric selection. By monitoring the 
probability of selecting the center cell of neighborhoods for a tournament se- 
lection, this selection operator allows tuning accurately and continuously the 
selective pressure with one single parameter (/?). The performance results on 
Q AP instances and NK-Landscapes showed different optimal settings of the cen- 
tric selection, and thus different ideal selective pressures. Using the punctuated 
equilibria model, we put in evidence the optimal values of the centric selection's 
control parameter observed on QAP instances and NK-Landscapes. The punc- 
tuated equilibria model also put in evidence that the ideal selective pressure is 
not constant during the search process in the case of QAP instances. 

In this paper, we used the PEM in order to explain experimental results. In 
future works, we will use it in order to predict optimal exploration / exploitation 
trade-offs and to adapt the selective pressure during the runs. To do so, we will 
both estimate the Py and tune the selection operator online during the search 
process. The centric selection will be used in auto-adaptative algorithms with 
the advantage of modifying the exploration / exploitation ratio with a single 
parameter. It will also be applied on real problems and compared to other 
optimization methods. 
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